AITopics | term document matrix

Collaborating Authors

term document matrix

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

information retrieval document search using vector space model in R

#artificialintelligenceDec-9-2020, 11:30:38 GMT

Now calculate cosine similarity between each document and each query. For each query sort the cosine similarity scores for all the documents and take top-3 documents having high scores.

query, term document matrix, vector, (14 more...)

#artificialintelligence

Country:

North America > United States > Illinois > Cook County > Chicago (0.05)
North America > United States > Hawaii > Honolulu County > Honolulu (0.05)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Law (0.96)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.45)

Add feedback

R

#artificialintelligenceMar-29-2018, 14:11:40 GMT

If only love were so simple – How to graph a heart using R from a fun site called Date By Number. I highly recommend checking it out. If only love were so simple – How to graph a heart using R from a fun site called Date By Number. I highly recommend checking it out.

artificial intelligence, decision tree, machine learning, (17 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.57)

Add feedback

Semantic analysis of webpages with machine learning in Go · James Bowman

#artificialintelligenceOct-12-2017, 02:49:01 GMT

I spend a lot of time reading articles on the internet and started wondering whether I could develop software to automatically discover and recommend articles relevant to my interests. There are various aspects to this problem but I have decided to concentrate first on the core part of the problem: the analysis and classification of the articles. To illustrate the problem, lets consider the following string representing an article for the purpose of this example. We will attempt to use this article as a query to find similar or related articles from the following set of strings (usually referred to as a'corpus'), where each string also represents an article. The approaches we will consider for this example will work with any type of query equally whether the query is itself an article as above or simply a short string of words.

artificial intelligence, machine learning, natural language, (18 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.36)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.30)

Add feedback

Optimising algorithms in Go for machine learning - Part 2 · James Bowman

#artificialintelligenceJun-15-2017, 11:00:22 GMT

This is the second in a series of blog posts sharing my experiences working with algorithms and data structures for machine learning. These experiences were gained whilst building out the nlp project for LSA (Latent Semantic Analysis) of text documents. In Part 1 of this series, I explored alternative approaches for representing and applying TF-IDF transforms for weighting term frequencies across document corpora. We tested the approaches using Go's inbuilt benchmark functionality and found that our optimisations materially improved not just memory consumption but also performance (reducing memory consumption and processing time from 7 GB and 41 seconds to 250 KB and 0.8 seconds respectively). In this blog post I shall explore other areas for optimisation, seeking to further reduce memory consumption and processing time.

artificial intelligence, machine learning, matrix, (14 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Semantic analysis of webpages with machine learning in Go · James Bowman

#artificialintelligenceMar-22-2017, 08:00:44 GMT

artificial intelligence, machine learning, natural language, (18 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.36)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.30)

Add feedback

A Case Study in Text Mining: Interpreting Twitter Data From World Cup Tweets

Godfrey, Daniel, Johns, Caley, Meyer, Carl, Race, Shaina, Sadek, Carol

arXiv.org Machine LearningAug-21-2014

Cluster analysis is a field of data analysis that extracts underlying patterns in data. One application of cluster analysis is in text-mining, the analysis of large collections of text to find similarities between documents. We used a collection of about 30,000 tweets extracted from Twitter just before the World Cup started. A common problem with real world text data is the presence of linguistic noise. In our case it would be extraneous tweets that are unrelated to dominant themes. To combat this problem, we created an algorithm that combined the DBSCAN algorithm and a consensus matrix. This way we are left with the tweets that are related to those dominant themes. We then used cluster analysis to find those topics that the tweets describe. We clustered the tweets using k-means, a commonly used clustering algorithm, and Non-Negative Matrix Factorization (NMF) and compared the results. The two algorithms gave similar results, but NMF proved to be faster and provided more easily interpreted results. We explored our results using two visualization tools, Gephi and Wordle.

artificial intelligence, machine learning, social media, (15 more...)

arXiv.org Machine Learning

1408.5427

Country: North America > United States > North Carolina (0.46)

Genre: Research Report (0.70)

Industry:

Information Technology > Services (0.84)
Leisure & Entertainment (0.67)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.68)

Add feedback